CDS
Accession Number | TCMCG024C24151 |
gbkey | CDS |
Protein Id | XP_021999029.1 |
Location | join(34250548..34250622,34251801..34251911,34253594..34253692,34253824..34253886,34254135..34254212,34254413..34254587,34254830..34255014,34255085..34255154,34255248..34255370,34255608..34255685,34257147..34257379,34257627..34257881,34257982..34258160,34259155..34259437,34259518..34259646,34260078..34260239,34260699..34260842,34261023..34261232,34261310..34261564,34261924..34262088) |
Gene | LOC110895945 |
GeneID | 110895945 |
Organism | Helianthus annuus |
Protein
Length | 1023aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA396063 |
db_source | XM_022143337.2 |
Definition | protein ALWAYS EARLY 3 isoform X1 [Helianthus annuus] |
EGGNOG-MAPPER Annotation
COG_category | BDT |
Description | SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] |
KEGG_ko |
ko:K21773
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko04218
[VIEW IN KEGG] map04218 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGCGAATCAAGCACTTACCTGAGGGGAAATGTAACATGGTGGATATTCAAATTCAACGTAGAAAACCTACCTGGGAAGATCTCATGGGCCCACCAAGACACAGAAGTGTGAACAAGCGGTATCTGTATGGTGATGAAGCATCTCCAACTAAAGATGGAGGTAACACCAACAAGAAAAACCAACGAAAAAAAAAGTTGTCGGACATGCTGGGACCACTTTGGAGCAAGGAAGAGCTGGAACATTTTTACGAAGCATATCGTAAGCACGGGAGAGACTGGAAAAAGGTGGCTGCTGTTCTCCGAAACCGCTCTGTGGATATGGTGGAGGCTGTTTATTCTCTCAATAAAGCATACTTATCTCTCCCAGAGGGCACAGCCTCTGTTGTGGGCTTCATTGCTATGATGACTGATTATTACAGCAACATGGATGACAGGGATAGTGAACAGGAAAGGAATGGTGGTGCGGGACCATCTCGGAAGCCTCAAAAACGCATGCAGCGCAAAGTTCATGAGACCACATCCAAAGGCTTACGGTTGCATCCGGAGGCTGTTCCAACAGACTATGGTTTCTTGTCATTATTAAAAAAGAAGCGCTCTGCTGGAAGTCGACCTCGTGTTGTTGGAAAACGGACACCGCGTTTCCCTGTCTCACATTCATATGGAAATGTCAAGAAAGGTCAAAAACCAAGGGAAAACGATGATGAAGATGTTGCTCATAGTATAGCAATGACACTGGCAAAGGCTTCTAAAAGAGGTGGTTCCCCTAGTCAGAGTGCTGAAAGAATGTATGATGAATCAGATGATGAGATTGAAGGTAGCATGGAAGCTGATAATACAGAATTGTCTCGATACAAGAATTACATACGGGAAGCCTGTAGTGGCACCGAAGGGCAGAATTTAGGTTCTATACGAAGAAATTTTAATATTAAAGTTGCTGATGCAAAACCTTCAGGGTCTTCTGGGGTTTCCAACAAAAGAGATGAAAATTCTGCCTTTGATGCACTGGAAACTTTGGCAAATTTGTCTTTGTTGATTTCACCCGAAGCAAATGAAAATGGAAAAGACGAAGCTGTTGATGACTCTCATCTACTGCTAAATATACCTGCAAGTCGCAAAAGAGAGAAGCGTAATGGGAAACACTTCATCTCTAGATCAGAGGCTGCTGATGACAAACCTCAGGCATCTGCTAAAAACATGGTTGATGACACAAATGCTGCTTCTGAAGCAAAGGAATTCCATCCGTTAACGACTAAAGTTTCTAGAAAGAGACAGAAGATCGTAGCATCTAGAATTTCAAAAGATGAAGCCTCTGTTGATACAACCCCTCAGTTGGCCAAGACGGGTCAACATGTTACTTCTAGTATTCACTCAAGAAGGGAATCACATGTTTCTGAGATGAAACCATCTAACATACGTACCAAAGCCAGAAACAAGCGTAAAATGTACAAACCAAAGGCTCTTGAAATTTCTAATTTATCCGATGTAGTTGTAGGCAAGTCAAACGTACACTTGCCTTCTCTCAGTCAAAGAATAGACAAACTTAAGGGAGAGCTTTCTAATTGTTTGTCAAATCAACTAATGAGAAGATGGTGTGCATTTGAGTGGTTTTATAGTGCAATAGATGAACTTTGGTTTGCCAAAAGGGAGTTTGTAGAGTACTTGTATCATGTTGGTTTAGGTCATGTTCCAAGACTAACACGTGTTGAATGGGGTGTAATCAGAAGTTCTCTTGGTAAGCCACGGAGGTTCTCACATCAATTTCTAAAGGAAGAAAAAGACAAACTTTATCAGTTTCGGGATTCTGTGAGGACACATTATACTGAACTTCGTGCTGGTAGTAGAGATGGATTGCCAACAGATCTCGCAAGGCCATTAGCCGTAGGACAGCGTGTCTTAGCAATTTATCCAAAAACAAGAGAGATTCACGATGGAACTGTTCTGACAGTAGATCATGACAGGTGTCGTGTTCAATTTAATCATCCTGAGCTAGGTGCTGAAATAATCATGGATACTGATTGCATGCCATTGAATCCCATTGAGAATTTGCCTGCATCACTGATGCCTCGAAAACCTATTGACAAAGCCAATGGGCTGGGAAAAGGTCCAAGGCATGGAAATTTAGAGAACGTTGATGAGGTTGCTTCAGCTAAATCTGAATTCCAATCTAGAAACGGACCAAGGGATACACTCTCAGATCATCATGCAGCATACTCTGTGCCTGGTACAACGGCTAAATTTCAGGCAAAAGAAGCTGATGTTGAAGCAATTGCCGAGCTGACCCGTGCTCTTGACAAAAAGGAGGCTGTAGTTATTGAGTTGAGGCGAATGAACGACGATGTGCTGGAAAACCAAACTGTAGGTGACTCTGCTCTTAAGGATTCAAATGCTTTCAAAAAGCAATATGCAGCTGTTCTTGTACATTTAAATGATGCCAATGCTCAGGTTTCTTCTGCTTTATTTCGCTTGAGGCAACGCAACACATATCAGCAAAACTACCACTTTAACTTGCCAAAGGCAGTTGGTGATTTAAGTGACCATGGTGCCATCTTAGAAGACACTGCAAATCACACCCAAGAATCACAAGGTCAGGTCAATGAGATTGTTGAGATATCACGCACCAAGGCTCGTACAATGGTAGAAATAGCTATACAGGCATTGTCATCGCTGAAGCTCGATCCAACAGTTGAGATTGACAAGGCTGTTGATTACATGAATGATCTGCTTCCGTCGGATTATTCCTGTATTCCACCCATCCGGCCTGTTCCTCGTTCCAGTTTGTCATTGCATCAACAAGTCTCCACCGTAAAAGCACCTGATTCAAAGCCCATGAATTCATCAGATTTTAATAAATCATCAGTCCCCTCTGAACTGATTACTCAGTGTGTTGCTACCCTCTTCATCATTCAGAAATGTACAGAAAGACAGTTTCCTCCAGCTGAAGTTGCAAGTATATTGGATTCTGCTGTTTCTAGTTTACAACCTTGCTCTGCTCAGAATCTGCAGGTTTATGCTGAGATTCAAAAATACATGGGGATTATCAAGAACCAGATTTTAGCTCTCGTACCGACTTAA |
Protein: MRIKHLPEGKCNMVDIQIQRRKPTWEDLMGPPRHRSVNKRYLYGDEASPTKDGGNTNKKNQRKKKLSDMLGPLWSKEELEHFYEAYRKHGRDWKKVAAVLRNRSVDMVEAVYSLNKAYLSLPEGTASVVGFIAMMTDYYSNMDDRDSEQERNGGAGPSRKPQKRMQRKVHETTSKGLRLHPEAVPTDYGFLSLLKKKRSAGSRPRVVGKRTPRFPVSHSYGNVKKGQKPRENDDEDVAHSIAMTLAKASKRGGSPSQSAERMYDESDDEIEGSMEADNTELSRYKNYIREACSGTEGQNLGSIRRNFNIKVADAKPSGSSGVSNKRDENSAFDALETLANLSLLISPEANENGKDEAVDDSHLLLNIPASRKREKRNGKHFISRSEAADDKPQASAKNMVDDTNAASEAKEFHPLTTKVSRKRQKIVASRISKDEASVDTTPQLAKTGQHVTSSIHSRRESHVSEMKPSNIRTKARNKRKMYKPKALEISNLSDVVVGKSNVHLPSLSQRIDKLKGELSNCLSNQLMRRWCAFEWFYSAIDELWFAKREFVEYLYHVGLGHVPRLTRVEWGVIRSSLGKPRRFSHQFLKEEKDKLYQFRDSVRTHYTELRAGSRDGLPTDLARPLAVGQRVLAIYPKTREIHDGTVLTVDHDRCRVQFNHPELGAEIIMDTDCMPLNPIENLPASLMPRKPIDKANGLGKGPRHGNLENVDEVASAKSEFQSRNGPRDTLSDHHAAYSVPGTTAKFQAKEADVEAIAELTRALDKKEAVVIELRRMNDDVLENQTVGDSALKDSNAFKKQYAAVLVHLNDANAQVSSALFRLRQRNTYQQNYHFNLPKAVGDLSDHGAILEDTANHTQESQGQVNEIVEISRTKARTMVEIAIQALSSLKLDPTVEIDKAVDYMNDLLPSDYSCIPPIRPVPRSSLSLHQQVSTVKAPDSKPMNSSDFNKSSVPSELITQCVATLFIIQKCTERQFPPAEVASILDSAVSSLQPCSAQNLQVYAEIQKYMGIIKNQILALVPT |